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cy^ ; Abstract 
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'^ ■ We address the question of the growth of firm size. To this end, we an- 

Q . alyze the Compustat data base comprising all publicly-traded United States 

O I manufacturing firms within the years 1974-1993. We find that the distribu- 

tion of firm sizes remains stable for the 20 years we study, i.e., the mean 
. value and standard deviation remain approximately constant. We study the 

^vq ■ distribution of sizes of the "new" companies in each year and find it to be well 

^O , approximated by a log-normal. We find (i) the distribution of the logarithm 

pi I of the growth rates, for a fixed growth period of one year, and for companies 

^D I with approximately the same size S displays an exponential form, and (ii) the 

^«. ■ fluctuations in the growth rates — measured by the width of this distribution 

<Ti — scale as a power law with S, a\ ~ S~^ . We find that the exponent /? 
takes the same value, within the error bars, for several measures of the size of 
a company. In particular, we obtain: /3 = 0.20 it 0.03 for sales, /? = 0.18 it 0.03 
'^ \ for number of employees, /3 = 0.18 it 0.03 for assets, /? = 0.18 it 0.03 for cost 

Q ■ of goods sold, and /3 = 0.20 ± 0.03 for property, plant, & equipment. 
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I. INTRODUCTION 

Statistical physics has undergone many changes in emphasis during the last few decades. 
The seminal works of the '60s and '70s on critical phenomena ||l],H provided physicists with 
a new set of tools to study nature ||,^ . Fields such as biophysics, medicine, geomorphology, 
geology, evolution, ecology or meteorology are now common areas of application of statistical 
physics. 

In particular, several statistical physics research groups have turned their attention to 
problems in economics [§,§ and finance ||7|-[l5[|. On the other hand, the concepts of statistical 
physics (e.g., self-organization) have started to penetrate the study of economics [T^. In this 



article, we extend the study of Ref. on the growth rate of manufacturing companies. One 
motivation for the present study is the considerable recent interest in economics in developing 
a richer theory of the firm p!7|-|36|l . In standard microeconomic theory, a firm is viewed as a 
production function for transforming inputs such as labor, capital, and materials into output 
||T9| , |2^J30| . When dynamics are incorporated into the model, the link between production 
in one period and production in another arises because of investment in durable, physical 
capital and because of technological change (which in turn can arise from investments in 
research and development). Recent work on firm dynamics emphasizes the effect of how 
firms learn over time about their efficiency relative to competitors [^,^,^ . The production 
dynamics captured in these models are not, however, the only source of actual firm dynamics. 
Most notably, the existing models do not account for the time needed to assemble the 
organizational infrastructure needed to support the scale of production that typifies modern 
corporations. 

We studied all United States (US) manufacturing publicly-traded firms from 1974 to 
1993. The source of our data is Compustat which is a database on all publicly-traded firms 
in the US. Compustat obtains this information from reports that publicly traded companies 
must file with the US Securities and Exchange Commission. The database contains a large 
amount of information on each company. Among the items included are "sales," "cost of 
goods sold," "assets," "number of employees," and "property, plant, & equipment." 

Another item provided for each company is the Standard Industrial Classification (SIC) 
code. In principle, two companies in the same primary SIC code are in the same market; 
that is, they compete with each other. In practice, defining markets is extremely difficult 
p9| . More important for our analysis, virtually all modern firms sell in more than one 
market. Companies that operate in different markets do report some disaggregated data on 
the different activities. For example, while Philip Morris was originally a tobacco producer, 
it is also a major seller of food products (since its acquisition of General Foods) and of beer 
(since its acquisition of Miller Beer). Philip Morris does report its sales of tobacco products, 
food products, and beer separately. However, companies have considerable discretion in how 
to report information on their different activities, and differences in their choices make it 
difficult to compare the data across companies. 

In this paper, the only use we make of the primary SIC codes in Compustat is to restrict 
our attention to manufacturing firms. Specifically, we include in our sample all firms with 
a major SIC code from 2000-3999. We do not use the data from the individual business 
segments of a firm, nor do we divide up the sample according to primary SIC codes. We 
should acknowledge that this choice is at odds with the mainstream of economic analysis. 



In economics, what is commonly called the "theory of the firm" is actually a theory of a 
business unit. To build on the Philip Morris example, economists would likely not use a 
single model to predict the behavior of Philip Morris. At the very least, they would use 
one model for the tobacco division, one for the food division, and one for the beer division. 
Indeed, given the available data, they might construct a completely separate model of, say, 
the sales of Maxwell House coffee. Absent any effect of the output of one of Philip Morris' 
products on either the demand for or costs of its other products, the models of the different 
components of the firm would be completely separate. Because the standard model of the 
firm applies to business units, it does not yield any prediction about the distribution of the 
size of actual, multi-divisional firms or their growth rates. 

On the other hand, the approach we take in this study is part of a distinguished tradition. 
First, there is a large body of work by Economics Nobel laureate H. Simon [^ and various 
co-authors that explored the stochastic properties of the dynamics of firm growth. Also, in 
a widely cited article (that nonetheless has not had much impact on mainstream economic 
analysis), R. Lucas, also a Nobel laureate, suggests that the distribution of firm size depends 
on the distribution of managerial ability in the economy rather than on the factors that 
determine size in the conventional theory of the firm ||26|| . 

In summary, the objective of our study is to uncover empirical scaling regularities about 
the growth of firms that could serve as a test of models for the growth of firms. We find: (i) 
the distribution of the logarithm of the growth rates for firms with approximately the same 
size displays an exponential form, and (ii) the fluctuations in the growth rates — measured 
by the width of this distribution — scale as a power law with firm size. 

The paper is organized as follows: In Sect. II, we review the economics literature on the 
growth of companies. In Sects. Ill and IV, we present our empirical results for publicly- 
traded US manufacturing companies. Finally, in Sect. V, we present concluding remarks 
and discuss questions raised by our results. 

II. BACKGROUND 

In 1931, the French economist Gibrat proposed a simple model to explain the empirically 
observed size distribution of companies [jl^. He made the following assumptions: (i) the 
growth rate i? of a company is independent of its size (this assumption is usually referred 
to by economists as the law of proportionate effect), (ii) the successive growth rates of a 
company are uncorrelated in time, and (iii) the companies do not interact. 

In mathematical form, Gibrat's model is expressed by the stochastic process: 

St+At = St{l + et), (1) 

where St+At and St are, respectively, the size of the company at times (t -|- At) and t, and 
et is an uncorrelated random number with some bounded distribution and variance much 
smaller than one (usually assumed to be Gaussian) . Hence log St follows a simple random 
walk and, for sufficiently large time intervals u ^ At, the growth rates 

Ru^^ (2) 



are log-normally distributed. If we assume that all companies are born at approximately the 
same time and have approximately the same initial size, then the distribution of company 
sizes is also log-normal. This prediction from the Gibrat model is approximately correct 

MM- 



There is, however, considerable evidence that contradicts Gibrat's underlying assump- 
tions. The most striking deviation is that the fluctuations of the growth rate measured by 
the relative standard deviation cri{S) decline with an increase in firm size. This was first 
observed by Singh and Whittington [Q and confirmed by others |]6|j43|-^. The negative 



relationship between growth fluctuations and size is not surprising because large firms are 
likely to be more diversified. Singh and Whittington state that the decline of the standard 
deviation with size is not as rapid as if the firms consisted of independently operating sub- 
sidiary divisions. The latter would imply that the relative standard deviation decays as 
(Ji{S) ~ S*"^/^ [^. This confirms the common-sense view that the performance of different 



parts of a firm are related to each other. 

The situation for the mean growth rate is less clear. Singh and Whittington [^ consider 



the assets of firms and observe that the mean growth rate increases slightly with size. 



However, the work of Evans |43] and Hall [44], using the number of employees to define 



the company's size, suggests that the mean growth rate declines slightly with size. Dunne 
et al. [0 emphasize the effect of the failure rate of firms and the effect of the ownership 
status (single- or multi-unit firms) on the relation between size and mean growth rate. They 
conclude that the mean growth rate is always negatively related with size for single-unit 
firms; but for multi-unit firms, the growth rate increases modestly with size because the 
reduction in their failure rates overwhelms a reduction in the growth of nonfailing firms PS 



Another testable implication of Gibrat's law is that the growth rate of a firm is uncor- 
related in time. However, the empirical results in the literature are not conclusive. Singh 
and Whittington |^ observe positive first order correlations in the 1-year growth rate of a 
company (persistence of growth) whereas Hall [^ finds no such correlations. The possibility 



of negative correlations (regression towards the mean) has also been suggested p8| , ^ 

III. SIZE DISTRIBUTION OF PUBLICLY-TRADED COMPANIES 

In the following sections, we study the distribution of company sizes and growth rates. 
To do so, one problem that must be confronted is the definition of firm size. If all companies 
produced the same good (steel, say), then we could use a physical measure of output, such 
as tons. We are, however, studying companies that produce different goods for which there 
is no common physical measure of output. An obvious solution to the problem is to use the 
dollar value of output: the sales. A general alternative to measuring the size of output is to 
measure input. Again, since companies produce different goods, they use different inputs. 
However, virtually all companies have employees. As a result, some economists have used 
the number of employees as a measure of firm size. Three other possibilities involve the 
dollar value of inputs, such as the "cost of goods sold," "property, plant & equipment," 
or "assets." As we discuss below, we obtain similar results for all of these measures. We 
begin by describing the growth rate of sales. To make the values of sales in different years 
comparable, we adjust all values to 1987 dollars by the GNP price deflator. 
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FIG. 1. Number of publicly-traded manufacturing companies in the US for the period 
1974-1993 (right scale). Also shown is the number of companies entering the market and the 
number of companies leaving (left scale). 

Since the law of proportionate effect implies a multiplicative process for the growth of 
companies, it is natural to study the logarithm of sales. We thus define 



and the corresponding growth rate 



ri 



So = In 5*0 
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where 5*0 is the size of a company in a given year and Si its size the following year. 

Stanley et al. determined the size distribution of publicly-traded manufacturing compa- 
nies in the US [0. They found that for 1993, the data fit to a good degree of approximation 
a log-normal distribution. These results have been recently confirmed by Hart and Oulton 
^ for a sample of approximately 80000 United Kingdom companies. Here, we present a 
study of the distribution for a period of 20 years (from 1974 to 1993). 

Figure shows the total number of publicly-traded manufacturing companies present 
in the database each year. We also plot the number of new companies and of ^^ dying" 
companies (i.e., companies that leave the database because of merger, change of name or 
bankruptcy). 



Figure 2(a) shows the distribution of firm size in each year from 1974-1993. Particularly 
above the lower tails, the distributions lie virtually on top of each other. 

Fig. 2(a) 
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Fig. 2(c) 
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FIG. 2. (a) Probability density of the logarithm of the sales for publicly-traded manufacturing 
companies (with standard industrial classification index of 2000-3999) in the US for each of the 
years in the 1974-1993 period. All the values for sales were adjust to 1987 dollars by the GNP price 
deflator. Also shown (solid circles) is the average over the 20 years. It is visually apparent that the 
distribution is approximately stable over the period, (b) Probability density of the logarithm of 
sales for all the manufacturing companies, for the companies entering the market (shifted by a factor 
of 1/10), and for the companies leaving the market (shifted by a factor of 1/100), averaged over 
the 1974-1993 period. The distribution of new companies can be described to first approximation 
by a log-normal while the other distributions are better fitted by the exponential of a third order 
polynomial. Notice that the distributions of all companies and of dying companies are nearly 
identical. This suggests a nearly constant dependence of the dying probability on size, (c) Plot 
of the fraction of "dying" companies by size. We define this probability as the ratio of dying 
companies of a given size over the total number of companies of that size. The horizontal straight 
line is a guide for the eye for companies with sales above 10®. 

Thus the distribution is stable over this period. This is surprising because there is no 
existing theoretical reason to expect that the size distribution of firms could remain stable 
value slightly smaller than the average of all companies. One might as the economy grows, 
as the composition of output changes, and as factors that economists would expect to affect 
firm size (like computer technology) evolve. It is also important because it contradicts 
the predictions of the Gibrat model. Equation (|lD implies that the distribution of sizes of 
companies should get broader with time. In fact, the variance of the distribution should 
increase linearly in time. Thus, we must conclude that other factors, not included in Gibrat 's 
assumptions, must have important roles. 



One obvious factor not captured by the Gibrat assumption is the entry of new companies. 
Figure 2(b) shows that the size distribution of new pubhcly-traded companies is approxi- 
mately a log-normal with an average expect new companies to be much smaller on average 
than existing ones. However, new companies can come about through the merger of two 
existing companies, in which case the new company is bigger than either of the pre-existing 
companies. Another way that new companies come into existence is that very large compa- 
nies divest themselves of divisions that are, by themselves, large businesses. An example is 
AT&T's recent divestiture of its manufacturing division (Lucent) and its computer division 
(NCR). 

Another factor not included in Gibrat 's assumptions is the "dying" of companies. As 
shown in Fig. §3, this distribution is quite similar to the distribution for all companies. 
Thus, it suggests that the probability for a company to leave the market, whether by merger, 
change of name, or bankruptcy, is nearly independent of size. Fig. ^. 

When analyzing the data, it is important to consider the high level of the noise in the 
tails. In building a histogram from the data, the most straightforward method is to use 
equally spaced bins. However, doing so creates noisy results in the tails because of the small 
number of data points in these regions. One way to solve this problem, especially if some 
knowledge of the shape of the distribution exists, is to take bins chosen with such lengths 
that all of them receive approximately the same number of data points. In fact, we used 
equally spaced bins on a logarithmic scale, i.e., all firms with sales values falling into an 
interval between 8'^ and 8^^+^ with k an integer belong to one bin. 

IV. THE DISTRIBUTION OF GROWTH RATES 

The distribution p(ri|so) of the growth rates from 1974 to 1993 is shown in Fig. ^ for 
three different values of the initial sales. Remarkably, these curves can be approximated by 
a simple "tent-shaped" form. Hence the distribution is not Gaussian — as expected from 
the Gibrat approach [1^ — but rather is exponential 0, 



P{ri\so) = r- exp . (5) 

V2(Ti(so) V ^i(«o) / 

The straight lines shown in Fig. ^ are calculated from the average growth rate fi(so) and the 
standard deviation cri(so) obtained by fitting the data to Eq. (|^). The tails of the distribution 
in Fig. ^ are somewhat fatter than Eq. (|^) predicts. This deviation is the opposite of what 
one would find if the distribution were Gaussian. We find that the data for each annual 
interval from 1974-1993 also fit well to Eq. (^, with only small variations in the parameters 
fi(so) and Oxis^. 
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FIG. 3. Probability density p(ri|so) of the growth rate r = ln(5i/5o) for all publicly-traded 
US manufacturing firms in the 1994 Compustat database with Standard Industrial Classification 
index of 2000-3999. The distribution represents all annual growth rates observed in the 19-year 
period 1974-1993. We show the data for three different bins of initial sales (with sizes increasing 
by powers of 8): 8"^ < 5o < 8*^, 8^ < 5o < 8^ and 8^ < 5o < 8^°. Within each sales bin, each firm 
has a different value of i?, so the abscissa value is obtained by binning these R values. The solid 
lines are exponential fits to the empirical data close to the peak. We can see that the wings are 
somewhat "fatter" than what is predicted by an exponential dependence. 

A. Mean growth rate 

Economists typically have studied the relationship between mean growth rate and firm 
size by running a regression of growth rates on firm size sometimes with other control 
variables included. Rather than using regression analysis, we undertake a graphical analysis 
of the mean growth rate. Figure ^(a) displays f(so) as a function of initial size Sq for 
several years. Although the data are quite noisy, they suggest that there is no significant 
dependence of the mean growth rate on sq. Least squares fits of the individual curves to 
a form f(so) ~ Sq lead to estimates of the proportionality constant which are very small 
in magnitude (< 10~^), and whose sign can be positive or negative depending on the year. 
Our analysis suggests that if a dependence exists, it is very weak for any range of sizes 
where other factors, such as a bias of the sample towards successful companies, could be 
disregarded. 

The analysis for the average of the nineteen 1-year periods, which is displayed in Fig.^(b), 
confirms this observation. Furthermore, the figure suggests that the results do not change 
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when we consider other definitions of the size of a company. 

Fig. 4(a) 
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FIG. 4. (a) Mean 1-year growth rate fi(so) for several years. It is visually apparent that 
the data are quite noisy, and that there is no significant dependence on ^o (at most a logarithmic 
dependence with a very small coefficient). Also displayed is the mean growth rate for the 18-year 
period in Compustat. (b) Average for the 19 years of fi(so) for several size definitions: sales, assets, 
cost of goods sold and plant property and equipment. Error bars corresponding to one standard 
deviation are shown for sales — values for the other quantities are nearly identical. Again, no 
significant dependence on Sq is found. Although it seems likely that the slightly positive value of 
f(so) is a real effect, we cannot rule out the possibility of a bias of the data towards successful 
companies, (c) Standard deviation of the 1-year growth rates for different definitions of the size 
of a company as a function of the initial values. Least squares power law fits were made for all 
quantities leading to the estimates of /3: 0.18 ±0.03 for "assets," 0.20 ±0.03 for "sales," 0.18 ±0.03 
for "number of employees," 0.18±0.03 for "cost of goods sold," and 0.20±0.03 for "plant, property 
&: equipment." The straight lines are guides for the eye and have slopes 0.19 



B. Standard deviation of the growth rate 



Next, we study the dependence of o"i(so) on sq. As is apparent from Figs. |]-|, the width 
of the distribution of growth rates decreases with increasing sq. We find that cri(so) is well 
approximated for 8 orders of magnitude (from sales of less than 10^ dollars up to sales of 
more than 10^^ dollars) by the law 



cri(so) ~exp(-/?So), 
where /3 = 0.20 ± 0.03. Equation (Bl) implies the scaling law 



(6) 
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(^i{So) ~ 5'o 



(7) 



Figure ^(c) displays ai vs. So, and we can see that Eq. (0) is indeed verified by the data. 

C. Other Measures of Size 

In order to test further the robustness of our findings, we perform a parallel analysis for 
the number of employees. We find that the analogs of p(ri|so) and cri(so) behave similarly. 
For example, Fig. ^(c) shows the standard deviation of the number of employees, and we 
see that the data are linear over roughly 5 orders of magnitude, from firms with less than 
10 employees to firms with almost 10^ employees. The slope /3 = 0.18 ± 0.03 is the same, 
within the error bars, as found for the sales. 

As shown in Fig. §(c), we find that Eqs. (H) and (0) approximately describe three addi- 
tional indicators of a company's size, (i) assets (with exponent P = 0.18 ± 0.03) (ii) cost of 
goods sold {P = 0.18 =t 0.03) and (iii) property, plant & equipment (/3 = 0.20 ± 0.03). 

V. DISCUSSION 

What is remarkable about Eqs. (|]) and (^ is that they approximate the growth rates of 
a diverse set of firms. They range not only in their size but also in what they manufacture. 
The conventional economic theory of the firm is based on production technology, which 
varies from product to product. Conventional theory does not suggest that the processes 
governing the growth rate of car companies should be the same as those governing, e.g., 
pharmaceutical or paper firms. 

Fig. 5(a) 
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Fig. 5(b) 
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FIG. 5. (a) Dependence of ai on 5*0 for two subsets of the data corresponding to different 
values of the SIC codes. In principle, companies in different subsets operate in different markets. 
The figure suggests that our results are universal across markets, (b) Scaled probability density 
Pscai = V^(^i{so)p{ri\so) as a function of the scaled growth rate rgcai = V^i^^i — ^i(so)]/o'i(so). The 
values were rescaled using the measured values of fi(so) and ai(so). All the data collapse upon 
the universal curve Pscal = exp(— |rscal|) as predicted by Eqs. (^) and (|6|). 

Indeed, our findings are reminiscent of the concept of universality found in statistical 
physics, where different systems can be characterized by the same fundamental laws, inde- 
pendent of "microscopic" details. Thus, we can pose the question of the universality of our 
results: Is the measured value of the exponent (3 due to some averaging over the different 
industries, or is it due to a universal behavior valid across all industries? As a "robustness 
check," we split the entire sample into two distinct intervals of SIC codes. It is visually 
apparent in Fig. 5(a) that the same behavior holds for the different samples of industries. 

In statistical physics, scaling phenomena of the sort that we have uncovered in the 
sales and employee distribution functions are sometimes represented graphically by plotting 
a suitably "scaled" dependent variable as a function of a suitably "scaled" independent 
variable. If scaling holds, then the data for a wide range of parameter values are said to 
"collapse" upon a single curve. To test the present data for such data collapse, we plot in 
Fig. 11(b) the scaled probability density Pscai = V^o'{so)p{ri\so) as a function of the scaled 



13 



growth rates of both sales and employees rgcai = \/^[^i — ^i(so)]/o'(so)- The data collapse 
relatively well upon the single curve Pscai = exp(— |rscai|)- Our results for (i) cost of goods 
sold, (ii) assets, and (iii) property, plant & equipment are equally consistent with such 
scaling. The high degree of similarity in the behavior of sales, the number of employees, and 
of the other measures of size that we studied points to the existence of large correlations 
among those quantities, as one would expect. 

In summary, we study publicly-traded US manufacturing companies from 1974 to 1993. 
We find that the distribution of the logarithms of the growth rate decays exponentially. 
Furthermore, we observe that the standard deviation of the distribution of growth rates 
scales as a power law with the size S of the company. Our results support the possibility 
that the scaling laws used to describe complex but inanimate systems comprised of many 
interacting particles (as occurs in many physical systems) may be usefully extended to 
describe complex but animate systems comprised of many interacting subsystems (as occurs 
in economics). Furthermore, the kind of scaling laws found in this study can be viewed 
as empirical evidence supporting some hypothesis regarding the self-organization of the 
economy 
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